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such 



polynucleotides, and the use of such polynucleotides and 



inhibiting the production and function of the polypeptides of 
the present invention. 

The gastrointestinal tract is the most common site of 
both newly diagnosed cancers and fatal cancers occurring each 
year in the USA, figures are somewhat higher for men than for 
women. The incidence of colon cancer in the USA is 
increasing, while that of gastric cancer is decreasing, 
cancer of the small intestine is rare. The incidence of 
gastrointestinal cancers varies geographically. Gastric 
cancer is common in Japan and uncommon in the United States, 
whereas colon cancer is uncommon in Japan and common in the 
USA. An environmental etiologic factor is strongly suggested 
by the statistical data showing that people who move to a 
high-risk area assume the high risk. Some of the suggested 
etiologic factors for gastric cancer include aflatoxin, a 
carcinogen formed by aspergillus flavus and present in 
contaminated food, smoked fish, alcohol, and Vitamin A and 
magnesium deficiencies. A diet high in fat and low in bulk, 
and, possibly, degradation products of sterol metabolism may 



polypeptides . 



The present invention further relates to 



be the etiologic factors for colon cancer. Certain disorders 
may predispose to cancer, for exatt5)le, pernicious anetnia to 
gastric camcer, iintreated non- tropical sprue and imimine 
defects .to lymphoma and carcinoma, and ulcerative and 
granulomatous colitis, isolated polyps, and inherited 
familial polyposis to carcinoma of the colon . 

The most common tumor of the colon is adenomatous polyp. 
Primary lymphoma is rare in the colon cind most common in the 
small intestine. 

Adenomatous polyps are the most common benign 
gastrointestinal tximors. They occur throughout the GI tract, 
most commonly in the colon and stomach, and are foiind more 
frequently in males than in females. They may be single, or 
more commonly, multiple, and sessile or pedunculated. They 
may be inherited, as in familial polyposis and Gardener's 
syndrome, which primarily involves the colon. Development of 
colon cancer is common in familial polyposis. Polyps often 
cause bleeding, which may occult or gross, but rarely cause 
pain \mless con5)lications ensue. Papillary adenoma, a less 
common form found only in the colon, may also cause 
electrolyte loss and mucoid discharge. 

A malignant tumor includes a carcinoma of the colon 
which may be infiltrating or exopHytic and occurs most 
commonly in the rectosigmoid. Because the content of the 
ascending colon is liquid, a carcinoma in this area usually 
does not cause obstruction, but the patient tends to present 
late in the course of the disease with anemia, abdominal 
pain, or an abdominal mass or a palpable mass. 

The prognosis with colonic tumors depends on the degree 
of bowel walH-nvasion and on the presence of regional lyrrqph 
node involvement and distant metastases . The prognosis with 
carcinoma of the rectiam and descending colon is quite 
xinexpectedly good. Cure rates of 80 to 90% are possible with 
early resection before nodal invasion develops. For this 
reason, great care must be taken to exclude this disease when 



unexplained anemia, occult gastrointestinal bleeding, or 
change in bowel habits develop in a previously healthy- 
patient . Con5)lete removal of the lesion before it spreads to 
the lyTr5)h nodes provides the best chance of survival for a 
patient with cancer of the colon. Detection in an 
asytt^tomatic patient by occult -bleeding, blood screening 
results in the highest five year survival , 

Clinically suspected malignant lesions can usually be 
detected radiologically . Polyps less than 1 cm can easily be 
missed, especially, in the upper sigmoid and in the presence 
of diverticulosis. Clinically suspected and radiologically 
detected lesions in the esophagus, stomach or colon can be 
confirmed by fiber optjLc endoscopy combined with histologic 
tissue diagnosis made by directed biopsy and brush sitology. 
^: Colonoscopy is another method utilized to detect colon 

ijl diseases. Benign and malignant polyps not visualized by X- 

icay are often detected on colonoscopy. In addition, patients 
p with one lesion on X-ray often have additional lesions 

4^ detected on colonoscopy. Sigmoidoscope examination, however, 

■"^ only detects about 50% of colonic tumors. 

The cdDove methods of detecting colon cancer have 
drawbacks, for example, small colonic txamors may be missed by 
all of the above -described methods > The importance of 
detecting colon cancer is also extremely in5)ortant to prevent 
metastases . 

*QJ3 In accordance yf^tii an aspect of the present invention, 
there are provided nucleic acid probes coitprising nucleic 
acid molecules of sufficient length to specifically hybridize 
to the RNA transcribed from the human colon specific gene of 
the present invention or to DNA corresponding to such RNA. 

In accordance with another aspect of the present 
invention there is provided a method of and products for 
diagnosing colon cancer metastases by determining the 
presence of RNA transcribed from the human colon specific 
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gene of the present invention or DNA corresponding to such 
RNA in a saiiqole derived from a host. 

In accordance with yet another aspect of the present 
invention, there is provided a method of and products for 
diagnosing colon cancer metastases by detecting an altered 
level of a polypeptide corresponding to the colon specific 
gene of the present invention in a san?)le derived from a 
host, whereby an elevated level of the polypeptide indicates 
a colon cancer diagnosis. 

In accordance with another aspect of the present 
invention, there are provided isolated polynucleotides 
encoding a polypeptide of the present invention, including 
mRNAs, DNAs, cDNAs, genomic DNAs, as well as antisense 
analogs and biologically active and diagnostically or 
therapeutically useful fragments thereof. 

In accordance with a further aspect of the present 
invention, there are provided novel polypeptide encoded by 
the polynucleotides, as well as biologically active and 
diagnostically or therapeutically useful fragments, analogs 
and derivatives thereof. 

In accordance with yet a further aspect of the present 
invention, there is provided a process for producing such 
polypeptides by . recombinant techniques comprising culturing 
recombinant prokaryotic and/or eukaryotic host cells, 
containing a polynucleotide of the present invention, under 
conditions promoting expression of said proteins and 
subsequent recovery of said proteins. 

In accordance with yet a further aspect of the present 
invention, there are provided antibodies specific to such 
polypeptides . 

in accordance with another aspect of the present 
invention, there are provided processes for using the 
polypeptides of the present invention to treat colon cancer 
and for using the polypeptide to screen for cott?)Ounds which 
interact with the polypeptides, for exani>le, con5)Ounds which 



inhibit or activate the. receptor for the polypeptide of the 
present invention . 

In accordance with yet another aspect of the present 
invention, there is provided a process to screen for 
compounds which interact with the polypeptides, for exanple, 
coitpounds which inhibit or activate the polypeptides of the 
present invention . 

In accordance with yet a further aspect of the present 
invention, there are provided processes for utilizing such 
polypeptides, or polynucleotides encoding such polypeptides, 
for in vitro purposes related to scientific research, 
synthesis of DNA and mcuiufacture of DMA vectors. 

These and other afpects of the present invention should 
be apparent to those skilled in the art from the teachings 
herein . 

The following drawings are illustrative of embodiments 
of the invention and are not meant to limit the scope of the 
invention as encompassed by^theci^aims . 

Figure 1 shows the cDNA ^quence and the corresponding 
deduced amino acid sequence for the human colon specific gene 
disclosed in this application. The standard one -letter 
abbreviations for amino acids are used. 

The term "colon specific gene" means that such gene is 
primarily expressed in tissues derived from the colon, and 
such gene may be - expressed in cells derived from tissues 
other than from the colon. However, the expression of such 
gene is significantly higher in tissues derived from the 
colon than from non-colon tissues. 

In accordance with an aspect of the present invention, 
there are provided isolated nucleic acids (polynucleotides) 
which encode for the mature polypeptide having the deduced 
amino acid sequence of Figure 1. (SBQ ID NO:2) or for the 
mature polypeptide encoded by the cDNA of the clone (s) 
deposited as ATCC Deposit No. 97129 on April 28, 1995. 
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A polynucleotide encoding the colon specific gene of the 
present invention was isolated from a human colon cancer cDNA 
library. The polynucleotide contains an open reading frame 
encoding' a protein of 158 amino acid residues. The 
polypeptide exhibits structural homology to a galactose 
specific lectin from a diamondback rattlesnake with 36% 
identity and 54% similarity over a 125 amino acid stretch and 
30% identity and 52% similarity to a human pancreatic stone 
protein precursor. 

The polynucleotides of the present invention may be in 
the form of RNA or in the form of DNA, which DNA includes 
cDNA, genomic DNA, and synthetic DNA. The DNA may be double- 
stranded or single -stranded, and if single stranded may be 
the coding strand or non-coding (anti-sense) strand. The 
coding sequence which encodes the mature polypeptides may be 
identical to the coding sequence shown in Figures 1 (SEQ ID 
NO:l) or that of the deposited clone (s) or may be a different 
coding sequence which coding sequence, as a result of the 
redundancy or degeneracy of the genetic code, encodes the 
same mature polypeptides as the DNA of Figure 1 (SEQ ID NO:l) 
or the deposited cDNA. 

The polynucleotides which encode for the mature 
polypeptide of Figure 1 (SEQ ID N0:^2) or for the mature 
polypeptide encoded by the deposited cDNA may include: only 
the coding sequence for the mature polypeptide; the coding 
sequence for the mature polypeptide and additional coding 
sequence such as a leader or secretory sequence or a 
proprotein sequence; the coding sequence for the mature 
polypeptide (and optionally additional coding sequence) and 
non-coding sequence, such as introns or non-coding sequence 
5' and/or 3' of the coding sequence for the mature 
polypeptides . 

Thus, the term "polynucleotide encoding a polypeptide" 
encott5)asses a polynucleotide which includes only coding 



sequence for the polypeptide as well as a polynucleotide 
which includes additional coding and/or non-coding sequence. 

The present invention further relates to variants of the 
hereinsQjove described polynucleotides which encode for 
fragments, analogs and derivatives of the polypeptide having 
the deduced amino acid sequence of Figure 1 (SEQ ID. NO: 2) or 
the polypeptide encoded by the cDNA of the deposited 
clone (s). The variants of the polynucleotides may be a 
naturally occurring allelic variant of the polynucleotides or 
a non-naturally occurring variant of the polynucleotides. 

Thus, the present invention includes polynucleotides 
encoding the same mature polypeptide as shown in Figures 1 
(SEQ ID NO: 2) or the s^me mature polypeptide encoded by the 
cDNA of the deposited clone (s) as well as variants of such 
polynucleotides which variants encode for a fragment, 
derivative or analog of the polypeptide of Figure 1 (SEQ ID 
NO: 2) or the polypeptide encoded by the cDNA of the deposited 
clone (s). Such nucleotide variants include deletion 

variants, substitution variants and addition or insertion 
variants . 

As hereinabove indicated, the polynucleotides may have 
a coding sequence which is a naturally occurring allelic 
variant of the , coding sequence shown in Figure 1 (SEQ ID 
N0:1) or of the coding sequence of the deposited clone (s) . 
As known in the art, an allelic variant is an alternate form 
of a polynucleotide sequence which may have a substitution, 
deletion or addition of one or more nucleotides, which does 
not substantially alter the function of the encoded 
polypeptide . 

The present invention also includes polynucleotides, 
wherein the coding sequence for the mature polypeptides may 
be fused in the same reading frame to a polynucleotide 
sequence which aids in expression and secretion of a 
polypeptide from a host cell, for example, a leader sequence 
which functions as a secretory sequence for controlling 



trcuisport of a polypeptide from the cell. The polypeptide 
having a leader sequence is a preprotein and may have the 
leader sequence cleaved by the host cell to form the mature 
form of the polypeptide. The polynucleotides may also encode 
for a proprotein which is the mature protein plus additional 
5' amino acid residues. A mature protein having a 
prosequence is a proprotein and is an inactive form of the 
protein. Once the prosequence is cleaved an active mature 
protein remains. Thus, for exair^jle, the polynucleotides of 
the present invention may encode for a mature protein, or for 
a protein having a prosequence or for a protein having both 
a prosequence and a presequence (leader sequence) . 

The polynucleotides of the present invention may also 
have the coding sequence fused in frame to a marker sequence 
which allows for purification of the polypeptide of the 
present invention. The marker sequence may be a hexa- 
histidine tag supplied by a pQE-9 vector to provide for 
purification of the mature polypeptide fused to the marker in 
the case of a bacterial host, or, for exanqple, the marker 
sequence may be a hemagglutinin (HA) tag when a mammalian 
host, e.g. COS -7 cells, is used. The HA tag corresponds to 
an epitope derived from the influenza hemagglutinin protein 
(Wilson, I., et.al.. Cell, 37:767 (1984)). 

The term "gene" means the segment of DNA involved in 
producing a polypeptide chain; it includes regions preceding 
and following the coding region (leader and trailer) as well 
as intervening sequences (introns) between individual coding 
segments (exons) . 

Fragments of the full length colon specific gene may be 
used-as a hybridization probe for a cDNA library to isolate 
the full length gene and to isolate other genes which have a 
high sequence similarity to the gene or similar biological 
activity. Probes of this type preferably have at least 30 
bases and may contain, for exan?>le, 50 or more bases. The 
probe may also be used to identify a cDNA clone corresponding 



to a full length transcript and a genomic clone or clones 
that contain the cotr5)lete coloii specific gene including 
regulatory and promoter regions, exons, and introns . An 
example of a screen comprises isolating the coding region of 
the gene by using the known DNA sequence to synthesize an 
oligonucleotide probe. Labeled oligonucleotides having a 
sequence cott^lementary to that of the gene of the present 
invention are used to screen a library of human cDNA, genomic 
DNA or mRNA to determine which members of the library the 
probe hybridizes to. 

The present invention further relates to 
polynucleotides which hybridize to the hereinabove-described 
sequences if there is dt least 70%, preferably at least 90%, 
and more prefereibly at least 95% identity between the 
sequences. The present invention particularly relates to 
polynucleotides which hybridize under stringent conditions to 
the hereinabove-described polynucleotides. As herein used, 
the term "stringent conditions" means hybridization will 
occur only if there is at least 95% and preferably at least 
97% identity between the sequences. The polynucleotides 
which hybridize to the hereinabove described polynucleotides 
in a preferred embodiment encode polypeptides which either 
retain substantially the same biological fiinction or activity 
as the mature polypeptide encoded by the cDNAs of Figure 1 
(SEQ ID NO:l) or the deposited cDNA(s) . 

Alternatively, the polynucleotide may have at least 20 
bases, preferably 30 bases, and more preferably at least 50 
bases which hybridize to a polynucleotide of the present 
invention and which has an identity thereto, as hereinabove 
described, and which may or may not retain activity. For 
exanqple, such polynucleotides may be eit^loyed as probes for 
the polynucleotide of Figure 1 (SEQ ID NO:l) , for example, 
for recovery of the polynucleotide or as a diagnostic probe 
or as a PGR primer. 



Thus, the present invention is directed to 
polynucleotides having* at least a 70% identity, preferably at 
least 90% and more preferably at least a 95% identity to a 
polynucleotide which encodes the polypeptide of Figure 1 (SEQ 
ID NO: 2) as well as fragments thereof, which fragments have 
at least 30 bases and preferably at least 50 bases and to 
polypeptides encoded by such polynucleotides. 

The deposit (s) referred to herein will be maintained 
under the terms of the Budapest Treaty on the International 
Recognition of the Deposit of Micro-organisms for purposes of 
Patent Procedure. These deposits are provided merely as 
convenience to those of skill in the art and are not an 
admission that a depo^dt is required under 35 U.S.C. §112. 
The sequence of the polynucleotides contained in the 
deposited materials, as well as the amino acid sequence of 
the polypeptides encoded thereby, are incorporated herein by 
reference and are controlling in the event of any conflict 
with any description of sequences herein. A license may be 
required to make, use or sell the deposited materials, and 
no such license is hereby granted. 

In accordance with another aspect of the present 
invention there are provided polynucleotides which are 
preferably at ieast 10 base pairs ^. in length and which 
hybridize to and are at least 70% identical to RNA (or 
corresponding DNA)^ transcribed from a human gene which has a 
coding sequence which is at least 90% identical to coding 
sequence of the DNA sequence of Figure 1 (SEQ ID N0:1) . 

Thus, the polynucleotide sequences which hybridize as 
described above may be used to hybridize to and detect the 
expression of the human gene to which they correspond for use 
in diagnostic assays as hereinafter described. 

In accordance with still another aspect of the present 
invention there are provided diagnostic assays for detecting 
micrometastases of colon cancer in a host. While applicant 
does not wish to limit the reasoning of the present invention 
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to any specific scientific theory, it is believed that the 
presence of active transcription of a colon specific gene of 
the present invention in cells of the host, other than those 
derived from the colon, is indicative of colon cancer 
metastases. This- is true because, while the colon specific 
gene are foxind in all cells of the body, their transcription 
to mRNA, cDNA and expression products is primarily limited to 
the colon in non-diseased individuals. However, if colon 
cancer is present, colon csincer cells migrate from the cancer 
to other cells, such that these other cells are now actively 
transcribing and expressing a colon specific gene at a 
greater level than is normally found in non-diseased 
individuals, i.e., transcription is higher than found in non- 
colon tissues in healthy individuals. It is the detection of 
this enhanced transcription or enhanced protein expression in 
cells, other than those derived from the colon, which is 
indicative of metastases of colon cancer . 

In one exatr5)le of such a diagnostic assay, an RNA 
sequence in a sanple derived from a ^ tissue other than the 
colon is detected by hybridization to a probe. The san5)le 
contains a nucleic acid or a mixture of nucleic acids, at 
least one of which is suspected of containing RNA (or 
corresponding cDNA) transcribed from ^a human colon specific 
gene of the present invention. Thus, for exan^le, in a form 
of an assay for determining the presence of a specific RNA in 
cells, initially RNA is isolated from the cells. 

A saTt5)le may be obtained from cells derived from tissue 
other than from the colon including but not limited to blood, 
urine, saliva, tissue biopsy and autopsy material- The use 
of such methods for detecting enhanced transcription to mRNA 
from a human colon specific gene of the present invention or 
fragment thereof in a saii5)le obtained from cells derived from 
other than the colon is well within the scope of those 
skilled in the art from the teachings herein. 




The isolation of mRNA. con5)rises isolating total cellular 
RNA by disrupting a cell and performing differential 
centrifugation. Once the total RNA is isolated, mRNA is 
isolated by making use of the adenine nucleotide residues 
known to those skilled in the art as a poly (A) tail found on 
virtually every eiikaryotic mRNA molecule at the 3' end 
thereof. Oligonucleotides composed of only deoxythymidine 
[oligo(dT)] are linked to cellulose and the oligo(dT)- 
cellulose packed into small columns. When a preparation of 
total cellular RNA is passed through such a column, the mRNA 
molecules bind to the oligo (dT) by the poly (A) tails while the 
rest of the RNA flows through the column. The bound mRNAs 
are then eluted from tjie column and collected. 

One exait5)le of detecting isolated mRNA transcribed from 
a colon specific gene of the present invention or a fragment 
thereof which encodes for a polypeptide of the present 
invention, comprises screening the collected mRNAs with 
specific oligonucleotide probes which have been custom 
designed to hybridize to the mRNA to be detected. The 
oligonucleotide probe coii?)rises a polynucleotide sequence 
which hybridizes to at least a portion of the mRNA (or cDNA 
produced from such RNA) transcribed from one or more of the 
colon specific gene of the present reinvention or fragment 
thereof. The polynucleotide sequences are at least 70% 
identical to and hybridize to mRNA (or cDNA produced from 
such RNA) transcribed from a hximan colon specific gene of the 
present invention having exons which includes DNA having at 
least 90%, preferably at least 95% identity and most 
preferably at least 97% identity to the DNA sequence of 
Figure 1 (SEQ ID NO:l) . 

It is also appreciated that such probes can be and are 
preferably labeled with an analytically detectable reagent to 
facilitate identification of the probe. Useful reagents 
include but are not limited to radioactivity, fluorescent 
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dyes or enzymes capable of catalyzing the formation of a 
detectable product. 

An example of detecting a polynucleotide complementary 
to the mRNA sequence (cDNA) utilizes the polymerase chain 
reaction* (PCR) in conjunction with reverse transcriptase. 
PCR is a very powerful method for the specific attplif ication 
of DNA or RNA stretches (Saiki et al . , Nature, 234:163-166 
(1986))- One application of this technology is in nucleic 
acid probe technology to bring up nucleic acid sequences 
present in low copy numbers to a detectable level . Numerous 
diagnostic and scientific applications of this method have 
been described by H.A. Erlich (ed.) in PCR Technology- 
Principles and ApplicaJ:ions for DNA Airplif ication, Stockton 
Press, USA, 1989, and by M.A. Inis (ed.) in PCR Protocols, 
Academic Press, San Diego, USA, 1990. 

RT-PCR is a combination of PCR with the reverse 
transcriptase enzyme. Reverse transcriptase is an enzyme 
which produces cDNA molecules from corresponding mRNA 
molecules. This is important since PCR an5)lifies nucleic 
acid molecules, particularly DNA, and this DNA may be 
produced from the mRNA isolated from a sample derived from 
the host- 

A specific example of an RT^PCR diagnostic assay 
involves removing a san5)le from a tissue of a host. Such a 
sample will be from a tissue, other than the colon, for 
exatrple, blood. Therefore, an example of such a diagnostic 
assay coTt5)rises whole blood gradient isolation of nucleated 
cells, total RNA extraction, RT-PCR of total RNA and agarose 
gel electrophoresis of PCR products. The PCR products 
coti?)rise cDNA complementary to RNA transcribed from the colon 
specific gene of the present invention or fragments thereof. 
More particularly, a blood sarr5)le is obtained eind the whole 
blood is combined with an equal volume of phosphate buffered 
saline, centrifuged and the lyttphocyte and granulocyte layer 
is carefully aspirated and rediluted in phosphate buffered 



saline and centrifuged again. The supernate is discarded and 
the pellet containing nucleated cells is used for RNA 
extraction using the RNazole B method as described by the 
manufacturer (Tel -Test Inc., Friendswood, TX) . 

Oligonucleotide primers and probes are prepared with 
high specificity to the DMA sequences of the present 
invention. The probes are at. least 10 base pairs in length, 
preferably at least 30 base pairs in length and may be at 
least 50 base pairs in length or more. The reverse 
transcriptase reaction and PGR an^jlif ication are performed 
sequentially without interruption. Taq polymerase is used 
during PGR and the PGR products are concentrated and the 
entire satt^jle is runi on a Tris -borate -EDTA agarose gel 
containing ethidium bromide. 

In accordance with another aspect of the present 
invention, there is provided a method of diagnosing a 
disorder of the colon, for example colon cancer, by 
determining altered levels of the colon specific polypeptides 
of the present invention in a biological sample, derived from 
tissue other than from the colon. Elevated levels of the 
colon specific polypeptides of the present invention, 
indicates active transcription and expression of the 
corresponding colon specific gene product. Assays used to 
detect levels of a colon specific gene polypeptide in a 
sample derived from a host are well-known to those skilled in 
the art and include radioimmunoassays, competitive -binding 
assays. Western blot analysis, BLISA assays and "sandwich" 
assays. A biological sample may include, but is not limited 
to. tissue extracts, cell satr?)les or biological fluids, 
however, "-in accordance with the present invention, a 
biological sair?)le specifically does not include tissue or 
cells of the colon. 

An ELISA assay (Coligan, et al.. Current Protocols in 
Tmmunoloav . 1(2), Chapter 6, 1991) initially comprises 
preparing an antibody specific to a colon specific 
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polypeptide of the present invention, preferably a monoclonal 
antibody. In addition, a reporter antibody is prepared 
against tlie monoclonal antibody. To the reporter antibody is 
attached . a detectable reagent . such as radioactivity, 
fluorescence or, in this example, a horseradish peroxidase 
enzyme . A sample is removed from a host and incubated on a 
solid support, e.g., a polystyrene dish, that binds the 
proteins in the sample. Any free protein binding sites on 
the dish are then covered by incubating with a non-specific 
protein, such as BSA. Next, the monoclonal antibody is 
incubated in the dish during which time the monoclonal 
antibodies attach to the colon specific polypeptide attached 
to the polystyrene disli. All unboxand monoclonal antibody is 
washed out with buffer. The reporter antibody linked to 
horseradish peroxidase is now placed in the dish resulting in 
binding of the reporter antibody to any monoclonal antibody 
bound to the colon specific gene polypeptide. Unattached 
reporter antibody is then washed out. Peroxidase substrates 
are then added to the dish and the amount of color developed 
in a given time period is a measurement of the amount of the 
colon specific polypeptide present in a given volimae of 
patient sample when con^ared against a standard curve. 

A con5)etition assay may be etrployed where antibodies 
specific to a colon specific polypeptide are attached to a 
solid support. The colon specific polypeptide is then 
labeled and the labeled polypeptide a san5)le derived from the 
host are passed over the solid support and the amount of 
label detected, for example, by liquid scintillation 
chromatography, can be correlated to a quantity of the colon 
specific polypeptide in the saii?)le. , 

A "Scuidwich" assay is similar to an ELISA assay. In a 
"sandwich" assay, colon specific polypeptides are passed over 
a solid support and bind to antibody attached to the solid 
support. A second antibody is then bound to the colon 
specific polypeptide. A third antibody which is labeled and 
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is specific to the second, antibody, is then passed over the 
solid support and binds to the second antibody and an amount 
can then be quantified. 

In alternative methods, labeled antibodies to a colon 
specific polypeptide are used. In a one-step assay, the 
target molecule, if it is present, is immobilized and 
incubated with a labeled antibody. The labeled antibody 
binds to the immobilized target molecule. After washing to 
remove the unboxand molecules, the sair^le is assayed for the 
presence^ of the label. In a two-step assay, immobilized 
target molecule is incubated with an \inlabeled antibody. The 
target molecule -labeled antibody cotr^lex, if present, is then 
bound to a second, Icib^led antibody that is specific for the 
unlabeled antibody. The sanple is washed and assayed for the 
presence of the label. 

The choice of marker used to label the antibodies will 
vary depending upon the application. However, the choice of 
marker is readily determinable to one skilled in the art. 
These labeled antibodies may be used in immunoassays as well 
as in histological applications to detect the presence of the 
proteins. The labeled antibodies may be polyclonal or 
monoclonal . 

The presence of active transcription, which is greater 
than that normally found, of the colon specific gene in cells 
other than from the colon, by the presence of an altered 
level of mRNA, cDNA or expression products is an important 
indication of the presence of a colon cancer which has 
metastasized, since colon cancer cells are migrating from the 
colon into the general circulation. Accordingly, this 
phenomenon may have important clinical implications since the 
method of treating a localized, as opposed to a metastasized, 
tumor is entirely different. 

The assays described above may also be used to test 
whether bone marrow preserved before chemotherapy is 
contaminated with micrometastases of a colon cancer cell. In 



the assay, blood cells from the bone marrow are isolated and 
treated as described above, this method allows one to 
determine whether preserved bone marrow is still suitable for 
transplantation after chemotherapy. 

Antibodies specific to the colon specific polypeptide, 
for example monoclonal antibodies, may also be used to target 
colon cancer cells, for exair5)le, in a method of homing 
interaction agents which, when contacting colon cancer cells, 
destroy them. This is true since the antibodies are specific 
for colon specific polypeptides which are primarily expressed 
in the colon, and a linking of the interaction agent to the 
antibody would cause the interaction agent to be carried 
directly to the prostafe . 

Antibodies of this type may also be used to do in vivo 
imaging, for example, by labeling the antibodies to 
facilitate scanning of the pelvic area and the colon. One 
method for imaging cotrprises contacting any tumor cells of 
the colon to be imaged with ah anti- colon specific antibody 
labeled with a detectable marker. The method is performed 
under conditions such that the labeled antibody binds to any 
colon specific polypeptides. In a specific exaTt5)le, the 
antibodies interact with the colon, for example, colon cancer 
cells, and fluoresce upon such contact such that imaging and 
visibility of the colon is enhanced to allow a determination 
of the diseased or non- diseased state of the colon. 

The present invention further relates to a colon 
specific gene polypeptide which has the deduced amino acid 
sequence of Figure 1 (SEQ ID NO: 2) or which have the amino 
acid sequences encoded by the deposited cDNA(s) , as well as 
fragments, analogs and derivatives of such polypeptide. 

The terms "fragment," "derivative" and "analog" when 
referring to the polypeptide of .Figure 1 (SEQ ID NO: 2) or 
that encoded by the deposited cDNA(s) , means a polypeptide, 
which retains essentially the same biological function or 
activity as such polypeptide. Thus, an analog includes a 
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proprotein which can be activated by cleavage of the 
proprotein portion to produce an active mature polypeptide. 

The polypeptides of the present invention may be 
recombinant polypeptides, natural polypeptides or synthetic 
polypeptides, preferably recombinant polypeptides. 

The fragment, derivative or analog of the polypeptides 
of Figure 1 (SEQ ID N0:2) or that encoded by the deposited 
cDNA(s) may be (i) one in which one or more of the amino acid 
residues are substituted with a conserved or non-conserved 
amino acid residue (preferably a conserved amino acid 
residue) and such substituted amino acid residue may or may 
not be one encoded by the genetic code, or (ii) one in which 
one or more of the amirip acid residues includes a substituent 
group, or (iii) one in which the polypeptide is fused with 
another con^jound, such as a compound to increase the half- 
life of the polypeptide (for exaii5)le, polyethylene glycol) , 
or (iv) one in which the additional amino acids are fused to 
the polypeptide, such as a leader or secretory sequence or a 
sequence which is ett5)loyed for purification of the mature 
polypeptide or a proprotein sequence. Such fragments, 
derivatives and analogs are deemed to be within the scope of 
those skilled in the art from the teachings herein. 

The polypeptides and polynucleotides of the present 
invention are preferably provided in an isolated form, and 
preferably are purified to homogeneity. 

The term "isolated" means that the material is removed 
from its original environment (e.g., the natural environment 
if it is naturally occurring). For example, a naturally- 
occurring polynucleotide or polypeptide present in a living 
animal is not isolated, but the same polynucleotide or 
polypeptide, separated from some or all of the coexisting 
materials in the natural system, is isolated. Such 
polynucleotides could be part of a vector and/or such 
polynucleotides or polypeptides could be part of a 
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cott^josition, and still be isolated in that such vector or 
coit5)osition is not part of its natural environment. 

The polypeptides of the present invention include the 
polypeptide of Figure 1 (SEQ ID N0:2) (in particular the 
mature polypeptide) as well as polypeptides which have at 
least 70% similarity (preferably at least 70% identity) to 
the polypeptide of Figure 1 (SEQ ID NO: 2) and more preferably 
at least 90% similarity (more preferably at least 90% 
identity) to the polypeptide of Figure 1 (SEQ ID NO: 2) and 
still more preferably at least 95% similarity (still more 
preferably at least 90% identity) to the polypeptide of 
Figure 1 (SEQ ID NO: 2) and also include portions of such 
polypeptides with suchj portion of the polypeptide generally 
containing at least 30 amino acids and more preferably at 
least 50 amino acids. 

AS Icnown in the art "similarity" between two 
polypeptides is determined by comparing the amino acid 
sequence and its conserved amino acid substitutes of one 
polypeptide to the sequence of a second polypeptide. 

Fragments or portions of the polypeptides of the present 
invention may be en5)loyed for producing the corresponding 
full-length polypeptide by peptide synthesis; therefore, the 
fragments may be employed as intermediates for producing the 
full-length polypeptides. Fragments or portions of the 
polynucleotides of the present invention may be used to 
synthesize full-length polynucleotides of the present 
invention . 

The present invention also relates to vectors which 
include polynucleotides of the present invention, host cells 
which are genetically engineered with vectors of the 
invention and the production of polypeptides of the invention 
by recombinant techniques. 

Host cells are genetically engineered (transduced or 
transformed or transfected) with the vectors of this 
invention which may be, for example, a cloning vector or an 
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expression vector. The vector may be, for example, in the 
form of a plasmid, a" viral particle, a phage, etc. The 
engineered host cells can be cultured in conventional 
nutrient media modified as appropriate for activating 
promoters, selecting transf ormants or aTt?)lifying the colon 
specific genes . The culture conditions , such as tetr^jerature , 
pH and the like, are those previously used with the. host cell 
selected for expression, and will be apparent to those of 
ordinarily skill in the art. 

The polynucleotides of the present invention may be 
employed for producing polypeptides by recombinant 
techniques. Thus, for example, the polynucleotide may be 
included in any one of a variety of esipression vectors for 
expressing a polypeptide. Such vectors include chromosomal, 
nonchromosomal and synthetic DNA sequences, e.g., derivatives 
of SV40; bacterial plasmids; phage DNA; baculovirus; yeast 
plasmids; vectors derived from combinations of plasmids and 
phage DNA, viral DNA such as vaccinia, adenovirus, fowl pox 
virus, and pseudorabies . However, any other vector may be 
used as long as it is replicable and viable in the host. 

The appropriate DNA sequence may be inserted into the 
vector by a variety of procedures. In general, the DNA 
sequence is inserted into an appropriate restriction 
endonuclease site(s) by procedures known in the art. Such 
procedures and others are deemed to be within the scope of 
those skilled in the art. 

The DNA sequence in the eiqjression vector is operatively 
linked to an appropriate expression control sequence (s) 
(promoter) to direct mRNA synthesis. As representative 
exsaaples of such promoters, there may be mentioned: LTR or 
SV40 promoter, the B- coli. lac or tr^, the phage lambda 
promoter and other promoters known to control expression of 
gene in prokaryotic or eukaryotic cells or their viruses. 
The expression vector also contains a ribosome binding site 
for translation initiation and a transcription terminator. 
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The vector may also include appropriate sequences for 
amplifying expression. 

In addition, the expression vectors preferably contain 
one or more selectable marker gene to provide a phenotypic 
trait for selection of treuisformed host cells such as 
dihydrof olate reductase or neomycin resistance for eukaryotic 
cell culture, or such as tetracycline or att^jicillin 
resistance in B. coli . 

The vector containing the appropriate DNA sequence as 
hereinabove described, as well as an appropriate promoter or 
control sequence, may be employed to transform an appropriate 
host to permit the host to express the protein. 

As representative} exan5>les of appropriate hosts, there 

may be mentioned: bacterial cells, such as E^^ coli, 

M streptomvces . Salmonella t yphimurium ; fungal cells, such as 

fn yeast; insect cells such as Drosoohila S2 and Spodoptera Sf9; 

ni animal cells such as CHO, COS or Bowes melanoma; 

adenoviruses; plant cells, etc. The selection of an 
appropriate host is deemed to be within the scope of those 
skilled in the art from the teachings herein. 
L More particularly, the present invention also includes 

recombinant constructs comprising one or more of the 
sequences as broadly described abbve. The constructs 
comprise a vector, such as a plasmid or viral vector, into 
which a sequence of the invention has been inserted, in a 
forward or reverse orientation. In a preferred aspect of 
this embodiment, the construct further comprises regulatory 
sequences, including, for exavaple. a promoter, operably 
linked to the sequence. Large numbers of suitable vectors 
and promoters are known to those of skill in the art, and are 
commercially available. The following vectors are provided 
by way of exan5>le. Bacterial: pQB70, pQB60, pQE-9 (Qiagen) , 
pBS, pDlO, phagescript, psixi74, pbluescript SK, pBSKS, 
pNH8A, pNHl6a, pNHlSA, pNH46A (Stratagene) ; .ptrc99a, pKK223- 
3, PKK233-3, pDR540, pRITS (Pharmacia). Eukaryotic: pWLNEO, 
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pSV2CAT, pOG44, pXTl, pSG (Stratagene) pSVK3, pBPV, pMSG, 
pSVL (Pharmacia) . However, any other plasmid or vector may 
be used as long as they are replicable and viable in the 
■ host . 

Promoter regions can be selected from any desired gene 
using CAT (chloran?)henicol transferase) vectors or other 
vectors with selectable markers. Two appropriate vectors are 
pKK232-8 and pCM7 . Particular named bacterial promoters 
include lad, lacZ, T3, T7, gpt, lambda Pr, Pl and trp. 
Eukaryotic promoters include CMV immediate early, HSV 
thymidine kinase, early and late SV40, LTRs from retrovirus, 
and mouse metallothionein-I. Selection of the appropriate 
vector and promoter ig well within the level of ordinary 

skill in the art. 

In a further embodiment, the present invention relates 
to host cells containing the above -de scribed constructs. The 
host cell can be a higher eukaryotic cell, such as a 
mammalian cell, or a lower eukaryotic cell, such as a yeast 
cell, or the host cell can be a prokaryotic cell, such as a 
bacterial cell. Introduction of the construct into the host 
cell can be effected by calcium phosphate transf ection, DEAE- 
Dext ran mediated trans feet ion, or electroporation (Davis, L., 
Dibner, M. , Battey, I., Basic Methods', in Molecular Biology, 
(1986)). 

The constructs in host cells can be used in a 
conventional manner to produce the gene product encoded by 
the recombinant sequence. Alternatively, the polypeptides of 
the invention can be synthetically produced by conventional 
peptide synthesizers. 

Proteins can be expressed in matmnalian cells, yeast, 
bacteria, or other cells under the control of appropriate 
promoters. Cell-free translation systems can also be 
eni)loyed to produce such proteins using RNAs derived from the 
DNA constructs of the present invention. Appropriate cloning 
and expression vectors for use with prokaryotic and 
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eiikaryotic hosts are described by Sambrook, et al . , Molecular 
Cloning: A Laboratory Manual, Second Edition, Cold Spring 
Harbor, N.Y., (1989), the disclosure of which is hereby 
incorporated by reference. 

Transcription of the DNA encoding the polypeptides of 
the present invention by higher exikaryotes is increased by 
inserting an enhancer sec[uence into the vector. Enhancers 
are cis -acting elements of DNA, usually about from 10 to 300 
bp that act on a promoter to increase its transcription. 
Exartqples including . the SV40 enhancer on the late side of the 
replication origin bp 100 to 270, a cytomegalovirus early 
promoter enhancer, the polyoma enhancer on the late side of 
the replication origins and adenovirus enhancers. 

Generally, recombinant expression vectors will include 
origins of replication and selectable markers permitting 
transformation of the host cell, e.g., the ati?>icillin 
resistance gene of E. coli and S . cerevisiae TRPl gene, and 
a promoter derived from a highly-expressed gene to direct 
transcription of a downstream structural sequence. Such 
promoters can be derived from operons encoding glycolytic 
enzymes such as 3 -phosphoglycerate kinase (PGK) , of-f actor, 
acid phosphatase, or heat shock proteins, among others. The 
heterologous structural sequence is assembled in appropriate 
phase with translation initiation and termination sequences. 
Optionally, the heterologous sequence can encode a fusion 
protein including an N-terminal identification peptide 
imparting desired characteristics, e.g., stabilization or 
siirplif ied purification of expressed recombinant product. 

Useful expression vectors for bacterial use are 
constructed by inserting a structural DNA sequence encoding 
a desired protein together with suitable translation 
initiation and termination signals in operable reading phase 
with a ftuictional promoter. The vector will cottqprise one or 
more phenotypic selectable markers and an origin of 
replication to ensure maintenance of the vector and to, if 
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desirable, provide atnplif ication within the host. Suitable 
prokaryotic hosts for transformation include E. coli . 
Bacillus subtilis . Salmonella tvphimurium and various species 
within ,the genera Pseudomonas , Streptomyces, and 
Staphylococcus, although others may also be eitqployed as a 
matter of choice. 

As a representative but nonlimiting example, useful 
expression vectors for bacterial use can comprise a 
selectable marker and bacterial origin of replication derived 
from commercially . available plasmids cottprising genetic 
elements of the well known cloning vector pBR322 (ATCC 
37017) . Such commercial vectors include, for exan5)le, 
pKK223-3 (Pharmacia Fiiie C!hemicals, Uppsala, Sweden) and GEMl 
(Promega Biotec, Madison, WI, USA) . These pBR322 "backbone" 
sections are combined with an appropriate promoter and the 
structural sequence to be expressed. 

Following transformation of a suitable host strain and 
growth of the host strain to an appropriate cell density, the 
selected promoter is induced by appropriate means (e.g., 
teit^jerature shift or chemical induction) and cells are 
cultured for an additional period . 

Cells are typically harvested by centrif ugation, 
disrupted by physical or chemical means, and the resulting 
crude extract retained for further purification. 

Microbial cel-ls employed in expression of proteins can 
be disrupted by any convenient method, including freeze-thaw 
cycling, sonication, mechanical disruption, or use of cell 
lysing agents, such methods are well know to those skilled in 
the art. 

Various mammalian cell culture systems can also be 
etrployed to express recombinant protein. Examples of 
mammalian expression systems include the COS -7 lines of 
monkey kidney fibroblasts, described by Gluzman, Cell, 23:175 
(1981) , and other cell lines capable of expressing a 
conpatible vector, for exattqple, the C127, 3T3, CHO, HeLa and 
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BHK cell lines. Maintnalian expression vectors will cott5>rise 
an origin of replication, a suitable promoter and enhancer, 
and also any necessary ribosome binding sites, 
polyadenylation site, splice donor and acceptor sites, 
transcriptional termination sequences, and 5' flanking 
nontranscribed sequences. DNA sequences derived from the 
SV40 splice, and polyadenylation sites may be used to provide 
the required nontranscribed genetic elements. 

The colon specific gene polypeptide can be recovered and 
purified from recombinant cell cultures by methods including 
ammonium sulfate or ethanol precipitation, acid extraction, 
anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrc4>hobic interaction chromatography, 
. affinity chromatography, hydroxylapatite chromatography and 
lectin chromatography. Protein refolding steps can be used, 
as necessary, in cott5)leting configuration of the mature 
Si protein. Finally, high performance liquid chromatography 

(HPLC) can be employed for final purification steps. 

The polynucleotides of the present invention may have 
the coding sequence fused in frame to a marker sequence which 
allows for purification of the polypeptide of the present 
invention. Rn exairqple of a marker sequence is a hexa- 
histidine tag which may be supplied by a vector, preferably 
a pQE-9 vector, which provides for purification of the 
polypeptide fused -to the marker in the case of a bacterial 
host, or, for exaxaple, the marker sequence may be a 
hemagglutinin (HA) tag when a mammalian host, e.g. COS -7 
cells, is used. The HA tag corresponds to an epitope derived 
from the influenza hemagglutinin protein (Wilson, I., et al.. 

Cell, 37:76=? (1984) ) . 

The polypeptide of the present invention may be a 
naturally purified product, or a product of chemical 
synthetic procedures, or produced by recombinant techniques, 
from a prokaryotic or eukaryotic host (for exairi)le, by 
bacterial, yeast, higher plant, insect and mammalian cells in 
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culture) . Depending upon the host en5)loyed in a recombinant 
production procedure, the polypeptide of the present 
invention may be glycosylated or may be non-glycosylated. 
Polypeptides of the invention may also include an initial 
methionine amino acid residue. 

In accordance with another aspect of the present 
invention there are provided assays which may be used to 
screen for therapeutics to inhibit the action of the colon 
specific gene or colon specific protein of the present 
invention. One assay takes advantage of the reductase 
function of these proteins. The present invention discloses 
methods for selecting a therapeutic which forms a cotiqplex 
with colon specific genaa proteins with sufficient affinity to 
prevent their biological action. The methods include various 
assays, including competitive assays where the proteins are 
iitimobilized to a support, and are contacted with a natural 
substrate and a labeled therapeutic either simultaneously or 
in either consecutive order, and determining whether the 
therapeutic effectively cott^jetes with the natural substrate 
in a manner sufficient to prevent binding of the protein to 
its sxibstrate. 

In aoiother embodiment, the substrate is immobilized to 
a support, and is contacted with both a labeled colon 
specific polypeptide and a therapeutic (or xmlabeled proteins 
and a labeled therapeutic) , and it is determined whether the 
amoxint of the colon specific polypeptide bound to the 
substrate is reduced in cottparison to the assay without the 
therapeutic added. The colon specific polypeptide may be 
labeled with antibodies. 

In another exaxtple of such a screening assay, there is 
provided a mammalian cell or membrane preparation expressing 
a colon specific polypeptide of the present invention 
incubated with elements which undergo simultaneous oxidation 
and reduction, for example hydrogen and oxygen which together 
form water, wherein the hydrogen could be labeled by 



radioactivity, e.g., tritium, in the presence of the coti^Doiind 
to be screened under conditions favoring the oxidation 
reduction reaction where hydrogen and oxygen form water. The 
ability of the cott5)Ound to block this interaction could then 
be measured. 

This invention provides a method for identification of 
the receptors for the polypeptide of the present invention. 
The gene encoding the receptor can be identified by numerous 
methods known to those of skill in the art, for example, 
ligand panning and FACS sorting (Coligan, et al , , Current 
Protocols in Iramun. , 1(2), (Chapter 5, (1991)). Preferably, 
expression cloning is employed wherein polyadenylated RNA is 
prepared from a cell 3^esponsive to the polypeptides, and a 
cDNA library created from this RNA is divided into pools and 
used to transfect COS cells or other cells that are not 
responsive to the polypeptides. Transfected cells which are 
grown on glass slides are exposed to the labeled 
polypeptides. The polypeptides can be labeled by a variety 
of means including iodination or inclusion of a recognition 
site for a site-specific protein kinase. Following fixation 
and incubation, the slides are subjected to autoradiographic 
analysis. Positive pools are identified and sub-pools are 
prepared and retransf ected using an iterative sub-pooling and 
rescreening process, eventually yielding a single clone (s) 
that encodes the putative receptor. 

As an alternative approach for receptor identification, 
the labeled polypeptides can be photoaf f inity linked with 
cell membrane or extract preparations that express the 
receptor molecule. Cross-linked material is resolved by PAGE 
analysis and exposed to X-ray film. The labeled coir5)lex 
containing the receptors of the polypeptides can be excised, 
resolved into peptide fragments, and sxibj ected to protein 
microsequencing. The amino acid sequence obtained from 
microsequencing would be used to design a set of degenerate 
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oligonucleotide probes to. screen a cDNA library to identify 
the genes encoding the putative receptors. 

Further, since the colon specific gene and gene product 
of the present invention is a growth regulator agonist and 
antagonists to the polypeptide could be determined by an 
assay comprising combining membrane preparations con^rising 
the receptor for the polypeptide euid a con5)ound to be 
screened and determining the generation of a signal from the 
receptor. In the case of determining an antagonist, the 
polypeptide of the present invention is added to the assay 
and the ability of the conqpound to compete for receptor 
sites, i.e., lack of generation of signal from receptor, 
could then be determined. 

Potential antagonists to a colon specific polypeptide 
include antibodies and anti- idiotypic antibodies as described 
above, or in some cases, an oligonucleotide, which binds to 
the polypeptide. 

Another potential antagonist is an antisense construct 
prepared using antisense technology, which is directed to a 
colon specific polynucleotide to prevent transcription. 
Antisense technology can be used to control gene expression 
through triple -helix formation or antisense DNA or RNA, both 
of which methods are based on binding ^of a polynucleotide to 
DNA or RNA. For example, the 5' coding portion of the 
polynucleotide sequence, which encodes for the mature 
polypeptides of 'the present invention, is used to design an 
antisense RNA oligonucleotide of from about 10 to 40 base 
pairs in length. A DNA oligonucleotide is designed to be 
cott5>lementary to a region of the gene involved in 
transcription (triple hel-ix -see Lee et al., Nucl. Acids 
Res., 6:3073 (1979); Cooney et al, Science, 241:456 (1988); 
and Dervan et al.. Science, 251: 1360 (1991)), thereby 
preventing transcription and the production of a colon 
specific polynucleotide. The antisense RNA oligonucleotide 
hybridizes to the mRNA in vivo and blocks translation of the 
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mRNA molecule into the colon specific gene polypeptide 
(antisense - Okano, J. Neufochem. , 56:560 (1991); 
Oligodeoxynucleotides as Antisense Inhibitors of Gene 
Expression, CRC Press, Boca Raton, FL (1988)). The 
oligonucleotides described above can also be delivered to 
cells such that the antisense RNA. or DNA may be expressed In 
vivo to inhibit production of the colon specific 
polypeptides . 

Potential antagonists also include a small molecule 
which binds to and occupies the active site of the colon 
specific polypeptide thereby making the active site 
inaccessible to substrate such that normal biological 
activity is prevented. i Examples of small molecules include 
but are not limited to small peptides or peptide-like 
molecules , 

The antagonists may be employed to treat colon cancer, 
since they interact with the function of colon specific 
polypeptides in a manner sufficient to inhibit natural 
function which is necessary for the viability of colon cancer 
cells. The antagonists may be employed in a composition with 
a pharmaceutical ly acceptable carrier, e.g., as hereinafter 
described. 

The polypeptides and auitagonists of the present 
invention may be ett5)loyed in combination with a suitable 
pharmaceutical carrier. Such cotr5)Ositions comprise a 
therapeutically effective amount of the polypeptide or 
antagonist, and a pharmaceutical ly acceptable carrier or 
excipient. Such a carrier includes but is not limited to 
saline, buffered saline, dextrose, water, glycerol, ethanol, 
and combinations thereof. The formulation should suit the 
mode of administration . 

The invention also provides a pharmaceutical pack or kit 
conprising one or more containers filled with one or more of 
the ingredients of the pharmaceutical compositions of the 
invention. Associated with such container (s) can be a notice 



in the form prescribed .by a governmental agency regulating 
the manufacture, use or sale of pharmaceuticals or biological 
products, which notice reflects approval by the agency of 
manufacture, use or sale for human administration. In 
addition, the pharmaceutical compositions may be employed in 
conjunction with other therapeutic compounds . 

The pharmaceutical cort5)Ositions may be administered in 
a convenient manner such as by the oral, topical, 
intravenous , intraperitoneal , intramuscular, subcutaneous , 
intranasal, intra-anal or intradermal routes. The 
pharmaceutical cott5>ositions are administered in an amount 
which is effective for treating and/or prophylaxis of the 
specific indication. Bn general, they are administered in an 
amount of at least about 10 fig/y^S body weight and in most 
cases they will be administered in an amount not in excess of 
about 8 mg/Kg body weight per day. In most cases, the dosage 
is from about 10 Aig/^g to about 1 mg/kg body weight daily, 
taking into account the routes of administration, sytr^jtoms, 
etc - 

The colon specific gene polypeptides and antagonists and 
agonists which are polypeptides may also be employed in 
accordance with the present invention by expression of such 
polypeptides in vivo, which is often referred to as "gene 
therapy . " 

Thus, for exanple, cells from a patient may be 
engineered with a polynucleotide (DNA or RNA) encoding a 
polypeptide ex vivo, with the engineered cells then being 
provided to a patient to be treated with the polypeptide. 
Such methods are well-known in the art. For exarqple, cells 
may be engineered by procedures known in the art by use of a 
retroviral particle containing RNA encoding a polypeptide of 
the present invention. 

Similarly, cells may be engineered in vivo for 
e:xpression of a polypeptide in vivo by, for example, 
procedures known in the art. As known in the art, a producer 



cell for producing a retroviral particle containing RNA 
encoding a polypeptide of the present invention may be 
administered to a patient for engineering cells in vivo and 
expression of the polypeptide in vivo. These and other 
methods for administering a polypeptide of the present 
invention by such method should be apparent to those skilled 
in the art from the teachings of the present invention. For 
example, the expression vehicle for engineering cells may be 
other than a retrovirus, for exan5>le, an adenovirus which may 
be used to engineer cells in vivo after combination with a 
suitable delivery vehicle. 

Retroviruses from which the retroviral plasmid vectors 
hereinabove mentioned Smay be derived include, but are not 
limited to, Moloney Murine Leukemia Virus, spleen necrosis 
virus, retroviruses such as Rous Sarcoma Virus, Harvey 
Sarcoma Virus, avian leukosis virus, gibbon ape leukemia 
vims, human immunodeficiency virus, adenovirus. 
Myeloproliferative Sarcoma Virus, and mammary tumor virus. 
In one embodiment, the retroviral plasmid vector is derived 
from Moloney Murine Leukemia Virus . 

The vector includes one or more promoters. Suitable 
promoters which may be ettployed include, but are not limited 
to, the retroviral LTR; the SV40 promoter; and the human 
cytomegalovirus (CMV) promoter described in Miller, et al., 
Biotechnioues , Vol: 7, No. 9, 980-990 (1989), or any other 
promoter (e.g., cellular promoters such as exikaryotic 
cellular promoters including, but not limited to, the 
histone, pol III, and jS-actin promoters) . Other viral 
promoters which may be enqployed include, but are not limited 
OD , adenovirus promoters , thymidine kinase (TK) promoters , 
and B19 parvovirus promoters. The selection of a suitable 
promoter will be apparent to those skilled in the art from 
the teachings contained herein. 

The nucleic acid sequence encoding the polypeptide of 
the present invention is under the control of a suitable 



promoter. Suitable promoters which may be en^loyed include, 
but are not limited to, adenoviral promoters, such as the 
adenoviral major late promoter; or hetorologous promoters, 
• such as the cytomegalovirus (CMV) promoter; the respiratory 
syncytial virus (RSV) promoter; inducible promoters, such as 
the MMT promoter, the metallothionein promoter; heat shock 
promoters; the albumin promoter; the ApoAI promoter; human 
globin promoters; viral thymidine kinase promoters, such as 
the Herpes Simplex thymidine kinase promoter; retroviral LTRs 
(including the modified retroviral LTRs hereinabove 
described) ; the j8-actin promoter; and hxaman growth hormone 
promoters. The promoter also may be the native promoter 
which controls the genfe encoding the polypeptides. 

The retroviral plasmid vector is employed to transduce 
packaging cell lines to form producer cell lines. Exaitples 
U1 of packaging cells which may be transfected include, but are 

Ri not limited to, the PE501, PA317, ^-2, ^-AM, PA12, T19-14X, 

p{ VT-19-17-H2, ^CRE, i^-CRIP, GP+E-86, GP+envAml2, and DAN cell 

5s lines as described in Miller, Human Gene Therapy, Vol. 1, 

^' pgs. 5-14 (1990), which is incorporated herein by reference 

p in its entirety. The vector may transduce the packaging 

cells through any means known in the art. Such means 
include, but are not limited to, electroporation, the use of 
liposomes, and CaP04 precipitation. In one alternative, the 
retroviral plasmid vector may be encapsulated into a 
liposome, or coupled to a lipid, and then administered to a 
host . 

The producer cell line generates infectious retroviral 
vector particles which include the nucleic acid sequence (s) 
encoding the polypeptides. Such retroviral vector particles 
then may be eti?)loyed, to transduce eukaryotic cells, either 
in vitro or in vivo. The transduced eukaryotic cells will 
express the nucleic acid sequence (s) encoding the 
polypeptide. Eukaryotic cells which may be transduced 
include, but are not limited to, embryonic stem cells, 
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embryonic carcinoma cells, as well as hematopoietic stem 

cells, hepatocytes, fibroblasts, myoblasts, keratinocytes, 

endothelial cells, and bronchial epithelial cells. 

This invention is also related to the use of a colon 

specific gene of the present invention as a diagnostic. For 

exan5)le, some diseases result from inherited defective genes. 

The colon specific gene of the present invention is 

overexpressed in colon cancer. A mutation in a colon 

specific gene of the present invention at the DNA level may 

be detected by a variety of techniques. Nucleic acids used 

for diagnosis (genomic DNA, mRNA, etc.) may be obtained from 

a patient's cells, other than from the colon, such as from 

blood, urine, saliva, J tissue biopsy and autopsy material. 

The genomic DNA may be used directly for detection or may be 

amplified enzymatically by using VCR (Saiki, et al.. Nature, 

324:163-166 (1986)) prior to analysis. RNA or cDNA may also 

be used for the same purpose. As an exan5)le, PGR primers 
tji , . 

H coital ementary to the nucleic acid of the instant xnventxon 

4- can be used to identify and analyze mutations in a colon 

specific polynucleotide of the present invention. For 

p exatrple, deletions and insertions can be detected by a change 

rf in size of the amplified product in con5)arison to the normal 

j= genotype. Point mutations can be identified by hybridizing 

P att^lified DNA to radiolabelled colon specific RNA or, 

• alternatively, radiolabelled antisense DNA sequences - 

Another well-established method for screening for 

mutations in particular segments of DNA after PGR 

an5)lif ication is single-strand conformation polymorphism 

(SSCP) analysis. PGR products are prepared for SSCP by ten 

cycles of reanqplif ication to incorporate '^P-dCTP, digested 

with an appropriate restriction enzyme to generate 200-300 bp 

fragments, and denatured by heating to 85 "C for 5 min. and 

then plunged into ice. Electrophoresis is then carried out 

in a nondenaturing gel (5% glycerol, 5% acrylamide) (Glavac, 

D. and Dean, M., Human Mutation, 2:404-414 (1993)). 
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Sequence differences between the reference gene and 
"tnutcints" may be revealed by the direct DNA sequencing 
method. In addition, cloned DNA. segments may be used as 
probes tq detect specific DNA segments. The sensitivity of 
this method is greatly enhanced when combined with PGR. For 
exaitple, a sequencing primer is used with double -stranded PGR 
product or a single -stranded template molecule generated by 
a modified PGR. The sequence determination is performed by 
conventional procedures with radiolabeled nucleotides or by 
automatic sequencing procedures with f luorescent-tags . 

Genetic testing based on DNA sequence differences may be 
achieved by detection of alteration in electrophoretic 
mobility of DNA fragmeiits and gels with or without denaturing 
agents. Small sequence deletions and insertions can be 
visualized by high- resolution gel electrophoresis. DNA 
fragments of different sequences may be distinguished on 
denaturing f ormamide gradient gels in which the mobilities of 
different DNA fragments are retarded in the gel at different 
positions according to their specific melting or partial 
melting temperatures (see, e.g., Myers, et al.. Science , 
230:1242 (1985)). In addition, sequence alterations, in 
particular small deletions, may be detected as changes in the 
migration pattern of DNA. ^ 

Sequence changes at specific locations may also be 
revealed by nuclease protection assays, such as Rnase and SI 
protection or the chemical cleavage method (e.g.. Cotton, et 
al., PNAS. USA . 85:4397-4401 (1985)). 

Thus, the detection of the specific DNA sequence may be 
achieved by methods such as hybridization, RNase protection, 
chemical cleavage, direct DNA sequencing, or the use of 
restriction enzymes (e.g.. Restriction Fragment Length 
Polymorphisms (RFLP) ) and Southern blotting. 

The sequences of the present invention are also valuable 
for chromosome identification. The sequence is specifically 
targeted to and can hybridize with a particular location on 
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an individual human chroxnosome . Moreover, there is a current 
need for identifying particular sites on the chromosome. Few 
chromosome marking reagents based on actual sequence data 
(repeat polymorphisms) are presently available for marking 
chromosomal location. The mapping of DNAs to chromosomes 
according to the present invention is an insert cint first step 
in correlating those sequences with gene associated with 
disease. 

Briefly, sequences can be mapped to chromosomes by 
preparing PGR primers (preferably 15-25 bp) from the cDNA. 
Cott5)uter analysis of the 3' untranslated region is used to 
rapidly select primers that do not span more than one exon in 
the genomic DNA, thus cpmplicating the amplification process. 
These primers are then used for PGR screening of somatic cell 
hybrids containing individual human, chromosomes . Only those 
hybrids containing the human gene corresponding to the primer 
will yield an amplified fragment. 

PGR mapping of somatic ceil hybrids is a rapid procedure 
for assigning a particular DNA to a particular chromosome. 
Using the present invention with the same oligonucleotide 
primers, sublocalization can be achieved with panels of 
fragments from specific chromosomes or pools of large genomic 
clones in an analogous manner. Other, mapping strategies that 
can similarly be used to map to its chromosome include in 
situ hybridization, prescreening with labeled flow- sorted 
chromosomes and preselection by hybridization to construct 
chromosome specif ic-cDNA libraries. 

Fluorescence in situ hybridization (FISH) of a cDNA 
clone to a metaphase chromosomal spread can be used to 
provide a precise chromosomal location in one step. This 
technique can be used with cDNA as short as 50 or 60 bases. 
For a review of this technique^ see Verma et al.. Human 
Chromosomes: a Manual of Basic Techniques, Pergamon Press, 
New York (1988) . 
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Once a sequence has- been mapped to a precise chromosomal 
location, the physical position of the sequence on the 
chromosome can be correlated with genetic map data. Such 
data are/ found, for exaTt5>le, in V. McKusick, Mendelian 
Inheritance in Man (availsdDle on line through Johns Hopkiins 
University Welch Medical Library) . The relationship between 
gene and diseases that have been mapped to the same 
chromosomal region are then identified through linkage 
analysis (coinheritance of physically adjacent genes) , 

Next, it is necessary to determine the differences in 
the cDNA or genomic sequence between affected auid \inaf fected 
individuals. If a mutation is observed in some or all of the 
affected individuals biit not in ciny normal individuals, then 
the mutation is likely to be the causative agent of the 
y disease. 

With current resolution of physical mapping and genetic 
fU mapping techniques, a cDNA precisely localized to a 

H chromosomal region associated with the disease could be one 

is of between 50 and 500 potential causative genes. (This 

assumes 1 megabase mapping resolution and one gene per 20 

b kb) - 

MJ The polypeptides, their fragments or other derivatives, 

> or analogs thereof, or cells expressihg them can be used as 

p an immunogen to produce antibodies thereto. These antibodies 

M can be, for exatt^rle, polyclonal or monoclonal antibodies. 

The present invention also includes chimeric, single chain, 
and hximanized antibodies, as well as Fab fragments, or the 
product of an Fab expression library. Various procedures 
known in the art may be used for the production of such 
antibodies and fragments. ~ 

Antibodies generated against the polypeptides 
corresponding to a sequence of the present invention can be 
obtained by direct injection of the polypeptides into an 
animal or by administering the polypeptides to an animal, 
preferably a nonhuman. The antibody so obtained will then 
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bind the polypeptides . itself. In this manner, even a 
sec[uence encoding only a fragment of the polypeptides can be 
used to generate antibodies binding the whole native 
polypeptides . Such antibodies can then be used to isolate 
the polypeptide from tissue expressing that polypeptide. 

For preparation of monoclonal antibodies, any technique 
which provides antibodies produced by continuous cell line 
cultures can be used. Exatrples include the hybridoma 
technique (Kohler and Milstein, 1975, Nature, 256:495-497), 
the trioma technique, the human B-cell hybridoma technique 
(Kozbor et al., 1983, Immunology Today 4:72), and the EBV- 
hybridoma technique to produce human monoclonal antibodies 
(Cole, et al., 1985, jin Monoclonal Antibodies and Cancer 
Therapy, Alan R. Liss, Inc., pp. 77-96). 

Techniques described for the production of single chain 
antibodies (U.S. Patent 4,946,778) can be adapted to produce 
single chain antibodies to immunogenic polypeptide products 
of this invention. Transgenic mice may also be used to 
generate antibodies . 

The antibodies may also be ett5>loyed to target colon 
cancer cells, for example, in a method of homing interaction 
agents which, when contacting colon cancer cells, destroy 
them. This is true since the antibodies are specific for the 
colon specific polypeptides of the present invention. A 
linking of the interaction agent to the antibody would cause 
the interaction agent to be carried directly to the colon. 

Antibodies of this type may also be used to do in vivo 
imaging, for exanqple, by labeling the antibodies to 
facilitate scanning of the pelvic area and the colon. One 
method for imaging comprises contacting any cancer cells of 
the colon to be imaged with an anti-colon specific protein- 
antibody labeled with a detectable marker. The method is 
performed under conditions such that the labeled antibody 
binds to the colon specific polypeptides. In a specific 
exatt?)le, the antibodies interact with the colon, for exanple. 
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colon cancer cells, and fluoresce upon contact such that 
imaging cind visibility of the colon are enhanced to allow a 
determination of the diseased or non-diseased state of the 
colon. 

The' present invention will be further described with 
reference to the following exanqples; however, it is to be 
landerstood that the present invention is not limited to such 
exan5)les. All parts or amounts, unless otherwise specified, 

are by weight. 

In order to facilitate xinderstanding of the following 
exanples certain frequently occurring methods and/or terms 
will be described. 

"Plasmids" are designated by a lower case p preceded 
and/or followed by capital letters and/ or numbers. The 
starting plasmids herein are either commercially available, 
publicly available on an unrestricted basis, or can be 
constructed from available plasmids in accord with p\ablished 
procedures. In addition, equivalent plasmids to those 
described are known in the art and will be apparent to the 
ordinarily skilled artisan. 

"Digestion" of DNA refers to catalytic cleavage of the 
DNA with a restriction enzyme that acts only at certain 
sequences in the DNA. The various restriction enzymes used 
herein are commercially available and their reaction 
conditions, cof actors and other requirements were used as 
would be known to the ordinarily skilled artisan. For 
analytical purposes, typically l ^9 of plasmid or DNA 
fragment is used with about 2 units of enzyme in about 20 ^1 
of buffer solution. For the purpose of isolating DNA 
fragments for plasmid construction, typically 5 to 50 of 
DNA are digested with 20 to 250 units of enzyme in a larger 
volume. Appropriate buffers and substrate amounts for 
particular restriction enzymes are specified by the 
manufacturer. Incubation times of about 1 hour at 37 'C are 
ordinarily used, but may vary in accordance with the 
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supplier' s instructions.. . After digestion the reaction is 
electrophoresed directly on a polyacrylamide gel to isolate 
the desired fragment. 

Size separation of the cleaved fragments is performed 
using 8 percent polyacrylamide gel described by Goeddel, D. 
et al.. Nucleic Acids Res., 8:4057 (1980). 

" Oligonucleotides" refers to either a single stranded 
polydeoxynucleotide or two complementary polydeoxynucleotide 
streuids which may be chemically synthesized. Such synthetic 
oligonucleotides have no 5' phosphate and thus will not 
ligate to another oligonucleotide without adding a phosphate 
with an ATP in the presence of a kinase. A synthetic 
oligonucleotide will legate to a fragment that has not been 
dephosphorylated . 

"Ligation" refers to the process of forming 
phosphodiester bonds between two doxable stranded nucleic acid 
fragments (Maniatis, T. , et al.. Id., p. 146). Unless 
otherwise provided, ligation may be accort^jlished using known 
buffers and conditions with 10 \anits of T4 DNA ligase 
("ligase") per 0.5 fig of approximately equimolar amounts of 
the DNA fragments to be ligated. 

Unless otherwise stated, transformation was performed as 
described in the method of Graham, F.. and Van der Eb, A., 
Virology, 52:456-457 (1973). 

Example 1 

Determination of Transcription o f the colon specific gene 

To assess the presence or absence of active 
transcription of a colon specific gene RNA, approximately 6 
ml of venous blood is obtained with a standard venipuncture 
technique using heparinized tubes. Whole blood is mixed with 
an equal volume of phosphate buffered saline, which is then 
layered over 8 ml of Ficoll (Pharmacia, Uppsala, Sweden) in 
a 15 -ml polystyrene tiobe. The gradient is centrifuged at 
1800 X g for 20 min at 5«»C. The lymphocyte and granulocyte 



layer (approximately 5 ml) is carefully aspirated and 
rediluted up to 50 ml with phosphate-buf f ered saline in a 50- 
ml tube, which is centrifuged again at 1800 X g for 2 0 min. 
at 5**C. / The supernatant is discarded and the pellet 
containing nucleated cells is used for UNA extraction using 
the RNazole B method as described by the manufacturer (Tel- 
Test Inc., Friendswood, TX) . 

To determine the quantity of mRNA from the gene of 
interest, a probe is designed with an identity to the mRNA 
sequence transcribed from a hvmian gene whose coding portion 
includes a DNA sequence of Figure 1. This probe is mixed 
with the extracted RNA and the mixed DNA and RNA are 
precipitated with ethajiol -70*»C for 15 minutes) • The pellet 
is resuspended in hybridization buffer and dissolved. The 
tubes containing the mixture are incubated in a 72*^C water 
bath for 10-15 mins. to denature the DNA. The tubes are 
rapidly transferred to a water bath at the desired 
hybridization temperature. Hybridization textqperature depends 
on the G + C content of the DNA. Hybridization is done for 
3 hrs. 0.3 ml of nuclease-Sl buffer is added and mixed well. 
50 fjLl of 4.0 M ammonium acetate and 0.1 M EDTA is added to 
stop the reaction. The mixture is extracted with 

phenol/chloroform and 20 /ig of carrier tRNA is added and 
precipitation is done with an equal volume of isopropanol. 
The precipitate is- dissolved in 40 fil of TE (pH 7.4) and run 
on an alkaline agarose gel. Following electrophoresis, the 
RNA is microsequenced to confirm the nucleotide sequence. 
(See Favaloro, J. et al.. Methods Enzymol., 65:718 (1980) for 
a more detailed review) . 

Two oligonucleotide primers are employed to an?)lify the 
sequence isolated by the above methods. The 5' primer is 20 
nucleotides long and the 3' primer is a complimentary 
sequence for the 3' end of the isolated mRNA. The primers 
are custom designed according to the isolated mRNA. The 
reverse transcriptase reaction and PCR anplif ication are 



performed sequentially without interruption in a Perkin Elmer 
9600 PGR machine (Emeryville, CA) . Four hundred ng total RNA 
in 20 /il diethylpyrocarbonate- treated water are placed in a 
65° C water bath for 5 min. and then quickly chilled on ice 
immediately prior to the addition of PGR reagents. The 50-^1 
total PGR volume consisted of 2.5 \inits Taq polymerase 
(Perkin-Elmer) . 2 iinits avian myeloblastosis virus reverse 
treinscriptase (Boehringer Mannheim, Indianapolis, IN) ; 200 nM 
each of dCTP, dATP, dGTP and dTTP (Perkin Elmer) ; 18 pM each 
primer, 10 mM Tris-HCl; 50 mM KCl; and 2 mM MgClj (Perkin 
Elmer). PGR conditions are as follows: cycle 1 is 42''C for 
15 min then 97°G for 15 s (1 cycle); cycle 2 is 95«»G for 1 
min. 60«»G for 1 min, a«id 72«»G for 30 s (15 cycles) ; cycle 3 
is 95°G for 1 min. 6Q°C for 1 min., and 72«»G for 1 min. (10 
cycles); cycle 4 is SB^C for 1 min., eo-G for 1 min., and 
72«'G for 2 min. (8 cycles) ; cycle 5 is 72°C for 15 min. (1 
cycle); and the final cycle is a 4»G hold until sample is 
taken out of the machine. The 50 -/il PGR products are 
concentrated down to 10 /il with vacuum centrifugation, and a 
sait?)le is then run on a thin 1.2 % Tris-borate-EDTA agarose 
gel containing ethidium bromide. A band of expected size 
would indicate that this gene is present in the tissue 
assayed. The amovint of RNA in the pellet may be quantified 
in numerous ways, for exaii5)le, it may be weighed. 

verification -of the nucleotide sequence of the PGR 
products is done by microsequencing. The PGR product is 
purified with a Qiagen PGR Product Piirif ication Kit (Qiagen, 
Ghatsworth, GA) as described by the manufacturer. One ng of 
the PGR product undergoes PGR sequencing by using the Taq 
DyeDeoxy Terminator Gycle sequencing kit in a Perkin-Elmer 
9600 PGR machine as described by Applied Biosystems (Foster, 
GA) . The sequenced product is purified using Gentri-Sep 
columns (Princeton Separations, Adelphia, NJ) as described by 
the cotipany. This product is then analyzed with an ABI model 
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373A DNA sequencing system (Applied Biosystems) integrated 
with a Macintosh Ilci ■con5>uter. 



Example 2 

Bacterial Expression and Purification of the colon specific 
crene protein and Use For Preparin g a Monoclonal Antibody 

The DNA sequence encoding a polypeptide of the present 
invention, ATCC # 97129, is initially amplified using PGR 
oligonucleotide primers corresponding to the 5' sequences of 
the processed protein (minus the signal peptide sequence) and 
the vector sequences 3' to the gene. Additional nucleotides 
corresponding to the DNA sequence are added to the 5 ' and 3 ' 
sequences respectively. The 5' oligonucleotide primer. 
GCAGGATCCTGGCrrCCAGAAGCATG (BAMHI) (SEQ ID NO: 3) may contain, 
for exatt?)le. a restriction enzyme site followed by 
nucleotides of coding sequence starting from the presumed 
terminal amino acid of the processed protein. The 3' 
sequence TACGGGTACCTTGCTCTATGGTCGGTAC (ASP718) (SEQ. ID NO: 4) 
may, for exan5>le, contain complementary sequences to a 
restriction enzyme site and also be followed by nucleotides 
of the nucleic acid sequence encoding the protein of 
interest. The restriction enzyme sites correspond to the 
restriction enzyme sites on a bacterial expression vector, 
for example, pQE-32 (Qiagen, inc. C3aatsworth, CA) . PQE-32 
encodes antibiotic resistance (Amp') , a bacterial origin of 
replication (ori) , an iPTG-regulatable promoter operator 
(P/O), a ribosome binding site (RBS) , a 6 -His tag and 
restriction enzyme sites. pQB-9 is then digested with the 
restriction enzymes corresponding to restriction enzyme sites 
contained in he primer sequences. The amplijied sequences 
are ligated " into pQE-9 and inserted in frame with the 
sequence encoding for the histidine tag and the RBS. The 
ligation mixture is then used to transform an E. coli strain, 
for example, MlS/rep 4 (Qiagen) by the procedure described in 
Sambrook, J. et al.. Molecular Cloning: A Laboratory Manual, 
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Cold Spring Laboratory . Press , (1989). Mi5/rep4 contains 
multiple copies of the plasmid pREP4, which expresses the 
lad repressor and also confers kanamycin resistance (Kan') , 
Transfonrieints are identified by their ability to grow on LB 
plates and airpicillin/kanamycin resistant colonies are 
selected. Plasmid DNA is isolated and confirmed by 
restriction analysis. Clones containing the desired 

constructs are grovm overnight (0/N) in liquid culture in LB 
media supplemented with both Ajxsp (100 ug/ml) and Kan (25 
ug/ml) . The O/N culture is used to inoculate a large culture 
at a ratio of 1:100 to 1:250. The cells are grown to an 
optical density 600 (O.D."°) of between 0.4 and 0.6. IPTG 
("Isopropyl-B-D-thiogaJ-acto pyranoside") is then added to a 
final concentration of 1 inM. IPTG induces by inactivating 
the lad repressor, clearing the P/0 leading to increased 
gene expression. Cells are grown an extra 3 to 4 hours. 
Cells are then harvested by centrifugation. The cell pellet 
is solubilized in the chaotropic agent 6 Molar Guanidine HCl. 
After clarification, solubilized protein is purified from 
this solution by chromatography on a Nickel -Chelate column 
under conditions that allow for tight binding by proteins 
containing the 6 -His tag (Hochuli, E. et al., J. 
Chromatography 411:177-184 (1984)). ^The protein is eluted 
from the column in 6 molar guanidine HCl pH 5 . 0 and for the 
purpose of renaturation adjusted to 3 molar guanidine HCl, 
lOOmM sodium phosphate, 10 tnmolar glutathione (reduced) and 
2 tnmolar glutathione (oxidized) . After incubation in this 
solution for 12 hours the protein is dialyzed to 10 mmolar 
sodiiun phosphate. 

The protein purified in this manner may be used as an 
epitope to raise monoclonal antibodies specific to such 
protein. The monoclonal antibodies generated against the 
polypeptide the isolated protein can be obtained by direct 
inject ioh of the polypeptides into an animal or by 
administering the polypeptides to an animal. The antibodies 
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so obtained will then - bind to the protein itself. Such 
antibodies can then be used to isolate the protein from 
tissue expressing that polypeptide by the use of an, for 
example, /ELISA assay. 

Example 3 
Expression via Gene Therapy 

Fibroblasts are obtained from a subject by skin biopsy. 
The resulting tissue is placed in tissue -culture medium and 
separated into small pieces. Small chunks of the tissue are 
placed on a wet surface of a tissue culture flask, 
approximately ten pieces are placed in each flask. The flask 
is turned upside down, closed tight and left at room 
tenperature over nighti After 24 hours at room temperature, 
the flask is inverted and the chunks of tissue remain fixed 
to the bottom of the flask and fresh media (e.g.. Ham's F12 
media, with 10% FBS, penicillin and streptomycin, is added. 
This is then incubated at 37«C for approximately one week. 
At this time, fresh media is added and subsequently changed 
every several days. After an additional two weeks in 
culture, a monolayer of fibroblasts emerge. The monolayer is 
trypsinized and scaled into larger flasks. 

pMV-7 (Kirschmeier, P.T. et al, DNA, 7:219-25 (1988) 
flanked by the long terminal repeats^ of the Moloney murine 
sarcoma virus, is digested with EcoRI and Hindlll and 
subsequently treated with calf intestinal phosphatase. The 
linear vector is fractionated on agarose gel and purified, 
using glass beads. 

The cDNA encoding a polypeptide of the present invention 
is attplified using PGR primers which correspond to the 5' and 
3' end sequences respectively. The 5' primer contains an 
EcoRI site and the 3' primer contains a Hindlll site. Equal 
quantities of the Moloney murine sarcoma virus linear 
backbone and the EcoRI and Hindlll fragment are added 
together, in the presence of T4 DNA ligase. The resulting 
mixture is maintained iinder conditions appropriate for 



-44- 



ligation of the two fragments. The ligation mixture is used 
to transform bacteria HBlOl, which are then plated onto agar- 
containing kanamycin for the purpose of confirming that the 
vector had the gene of interest properly inserted. 

The atnphotropic pA3l7 or GP+aml2 packaging cells are 
grown in tissue culture to confluent density in Dulbecco's 
Modified Eagles Medium (DMEM) with 10% calf serxim (CS) , 
penicillin and streptomycin. The MSV vector containing the 
gene is then added to the media and the packaging cells are 
transduced with the vector. The packaging cells now produce 
infectious viral particles containing the gene (the packaging 
cells are now referred to as producer cells) . 

Fresh media is added to the transduced producer cells, 
and sxibsequently, the media is harvested from a 10 cm plate 
Ji of confluent producer cells. The spent media, containing the 

infectious viral particles, is filtered through a millipore 
|!f filter to remove detached producer cells and this media is 

Ci then used to infect fibroblast cells. Media is removed from 

a sub- confluent plate of fibroblasts and quickly replaced 
with the media from the producer cells. This media is 
~5 removed and replaced with fresh media. If the titer of virus 

J is high, then virtually all fibroblasts will be infected and 

|2 no selection is required. If the titer is very low, then it 

^5 is necessary to use a retroviral vector that has a selectable 

marker, such as neo or his . 

The engineered fibroblasts are then injected into the 
host, either alone or after having been grown to confluence 
on cytodex 3 microcarrier beads . The fibroblasts now produce 
the protein product . 

— Example 4 

Cloning and expression of colon specific gene polypeptide 
using the baculovirus expression svstem 

The DNA sequence encoding the full length protein, ATCC 
# 97129, was anplified using PGR oligonucleotide primers 
corresponding to the 5' and 3' sequences of the gene: 
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The 5' primer has the sequence 5' ATCGGGATCCGCCATCATG 
GCTTCCAGAAGCATGCG (SEQ ID NO: 5) and contains a BamHI 
restriction enzyme site (in bold) followed by 6 nucleotides 
resembling an efficient signal . for the initiation of 
translation in eukaryotic cells (Kozak, M., J. Mol. Biol., 
196:947-950 (1987) which was just behind the first 20 
nucleotides of the colon specific gene (the initiation codon 
for translation "ATG" was underlined) . 

The 3 ' primer has the sequence 5 ' TACGGGTACCTTGCTC 
TATGGTCGGTAC 3' (SEQ ID NO: 6) and contains the cleavage site 
for the restriction endonuclease Asp718 and 5 nucleotides 
coTt^lementary to the 3 ' non-translated sequence of the colon 
specific gene. The anqplified sequences were isolated from a 
1% agarose gel using a commercially available kit 
("Geneclean, " BIO 101 Inc., La Jolla, Ca.) . The fragment was 
then digested with the endonucleases BamHI and Asp718 and 
then purified again on a 1% agarose gel . This fragment was 
designated F2 . 

The vector pA2 (modification of pVL941 vector, discussed 
below) was used for the expression of the colon specific 
protein using the baculovirus expression system (for review 
see: Summers, M.D. and Smith, G.E. 1987, A manual of methods 
for baculovirus vectors and insect cell culture procedures, 
Texas Agricultural Experimental Station Bulletin NO: 1555) . 
This e3q)ression ^vector contains the strong polyhedrin 
promoter of the Autographa califomica nuclear polyhedrosis 
virus (AcMNPV) followed by the recognition sites for the 
restriction endonucleases BamHI and Asp7l8. The 
polyadenylation site of the simian virus (SV)40 was used for 
efficient polyadenylation. For an easy selection of 
recombinant viruses the beta-galactosidase gene from E.coli 
was inserted in the same orientation as the polyhedrin 
promoter followed by the polyadenylation signal of the 
polyhedrin gene. The polyhedrin sequences were flanked at 
both sides by viral sequences for the cell -mediated 
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homologous recombination . of cotransf ected wild-type viral 
DNA. Many other baculovirus vectors could be used in place 
of pRGl such as pAc3 73, pVL941 and pAcIMl (Luckow, V.A, and 



The plasmid was digested with the restriction enzymes 
and then dephosphorylated using calf intestinal phosphatase 
by procedures known in the art. The DNA was then isolated 
from a 1% agarose gel using the commercially available kit 

("Geneclean" BIO 101 Inc., La Jolla, Ca.). This vector DNA 
was designated V2 . 

Fragment F2 and the dephosphorylated plasmid V2 were 
ligated with T4 DNA ligase. DH5a cells were then transformed 
and bacteria identified that contained the plasmid (pBac- 
colon specific polypepitde) with the colon specific gene 
using the enzymes BamHI and Asp718. The sequence of the 
cloned fragment was confirmed by DNA sequencing, 

5 iig of the plasmid pBac -colon specific gene was 
cotransf ected with 1.0 /ig of a commercially available 
linearized baculovirus ( "BaculoGold™ baculovirus DNA", 
Pharmingen, San Diego, CA.) using the lipofection method 

(Feigner et al. Proc. Natl. Acad. Sci. USA, 84:7413-7417 

(1987) ) . 

l^g of BaculoGold™ virus DNA arid 5 fig of the plasmid 
pBac -colon specific gene were mixed in a sterile well of a 
microtiter plate -containing 50 /xl of serxam free Grace's 
medium (Life Technologies Inc., Gaithersburg, MD) . 
Afterwards 10 /xl Lipofectin plus 90 /xl Grace's mediiam were 
added, mixed and incubated for 15 minutes at room 
tett5>erature . Then the transfection mixture was added 
dropwise to the Sf 9 insect cells (ATCC CRL 1711) seeded in a 
35 ram tissue culture plate with 1ml Grace's mediiam without 
serum. The plate was rocked back and forth to mix the newly 
added solution. The plate was then incubated for 27 hours at 
27«C. After 5 hours the transfection solution was removed 
from the plate and 1 ml of Grace's insect medium supplemented 



Summers, .M.D 



Virology, 170:31-39) . 



with 10% fetal calf senan was added. The plate was put back 
into an incubator and cultivation continued at 27**C for four 
days . 

After four days the supernatant was collected and a 
plaque assay performed similar as described by Summers and 
Smith (supra) . As a modification an agarose gel with "Blue 
Gal" (Life Technologies Inc., Gaithersburg) was used which 
allows an easy isolation of blue stained plaques. (A 
detailed description of a "plaque assay" can also be found in 
the user's guide for insect cell culture and baculovirology 
distributed by Life Technologies Inc., Gaithersburg, page 9- 
10) . 

Four days after tihe serial dilution, the viruses were 
added to the cells and blue stained plaques were picked with, 
the tip of an Eppendorf pipette. The agar containing the 
recombinant viruses was then resuspended in an Eppendorf tube 
containing 200 fxl of Grace's medium. The agar was removed by 
a brief centrifugation and the supernatant containing the 
recombinant baculovirus was used to infect Sf9 cells seeded 
in 35 ram dishes. Four days later the supematants of these 
culture dishes were harvested and then stored at 4°C. 

Sf9 cells were grown in Grace's medium supplemented with 
10% heat -inactivated FBS . The cells were infected with the 
recombinant baculovirus V-colon specific gene at a 
multiplicity of infection {MOD of 2. Six hours later the 
medium was removed and replaced with SF900 II medium minus 
methionine and cysteine (Life Technologies Inc., 
Gaithersburg) . 42 hours later 5 ^Ci of "S -methionine and 5 
/zCi ^^S cysteine (Amersham) were added. The colon specific 
protein was purified from "infected cells 72 hours post 
infection by cell lysis in hypotonic phosphate buffer and 
further purified by ion exchange chromatography, size 
exclusion chromatography, and reverse phase chromatography. 

Numerous modifications and variations of the present 
invention were possible in light of the above teachings and. 
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therefore, within the .scope of the appended claims, the 
invention may be practiced otherwise than as particularly 
described. 
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(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: =FERRARO, GREGORY D. 

(B) REGISTRATION NUMBER: 36,134 

(C) REFERENCE /DOCKET NUMBER: 325800-447 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 201-994-1700 

(B) TELEFAX: 201-994-1744 

(2) INFORMATION FOR SBQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 1114 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNBSS : SINGLE 
•(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 
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• (xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

rrT^rrj^GGCC AAACAGATTT GCAGATCAAG GAGAACCCAG GAGTITCAAA GAAGOTCTAG 

EE! sfc?iss? ii^^ ss^c siisss 

isss -™ sg^c^ 
i^E ?s!tsss s?ssrc? 
lEE lEk ?s??s?g 

Sil ii^c ^5^^= 
iEi lEE ?s25g?s ^?ssss ^^c^c iii^ 

5?S?S§- ISSS^ -S?2?S S5f=???g *«..-ax=t io»o 

GGGCCATACA TTCCTITAAT AAACCATTGT GTAC 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 158 AMINO ACIDS 

(B) TYPE: AMINO ACID 

(C) STRANDEDNESS : 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Ser Arg Ser Met Arg Leu Leu Leu l.eu Leu Ser Cys Leu 

Ala Lys Thr Gly Va! Leu Gly Asp He lie Met Arg Pro Ser Cys 

20 

Ala Pro Gly Trp Phe Tyr His Lys Ser Asn Cys Tyr Gly Tyr Phe 

Arg Lys Leu Arg Asn Trp Ser Asp Ala Glu Leu Glu Cys Gin Ser 

50 -1 
Tyr Gly Asn Gly Ala His Leu Ala Ser lie Leu Ser Leu Lys Glu 

Ala Ter Thr He Ax! Glu Tyr He Ser Gly Tyr Gin Arg Ser Gin 

80 °^ 
Pro He Trp He Gly Leu His Asp Pro Gin Lys Arg Gin Gin Trp 

Gin Trp He Asp G^y Ala Met Tyr Leu Tyr Arg Ser Trp Ser Gly 

1 1 Q lib 
Lys Ser Met Gly Gly Asn Lys His Cys Ala Glu Met Ser Ser Asn 

Asn Asn Phe Leu t" Trp Ser Ser Asn Glu Cys Asn Lys Arg Gin 

140 145 
His Phe Leu Cys Lys Tyr Arg Pro 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE^ CHARACTERISTICS 

(A) LENGTH: 26 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDBDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 

GCAGGATCCT GGCTTCCAGA AGCATG 
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(2) INFORMATION FOR SEQ ID NO: 4: 

C3 (i) SEQUENCE CHARACTERISTICS 

45 (A) LENOTH: 28 BASE PAIRS 

Ul (B) TYPE: NUCLEIC ACID 

Hi (C) STRANDBDNESS: SINGLE 

jjl (D) TOPOLOGY: LINEAR 

J? (ii) MOLECULE TYPE: Oligonucleotide 

r (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 

hi TACGGGTACC TTGCTCTATG GTCGGTAC 



(2) INFORMATION FOR |?Q ID NO: 5: 

(i) SBQXJENCE CHARACTERISTICS 

(A) LENGTH: 36 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDBDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 

ATCGGGATCC GCCATCATGG CTTCCAGAAG CATGCG 



( 2 ) INFORMATION FOR SEQ ID NO : 6 : 
(i) SEQUENCE CHARACTERISTICS 
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(A) LENGTH: 28 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDBDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 6 

TACGGGTACC TTGCTCTATG GTCGGTAC 
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